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The unknown nentrino mass hierarchy - whether the mass eigenstate is the heaviest or the 
lightest - represents a major gap in our knowledge of neutrino properties. Determining the hierarchy 
is a critical step toward further precision measurements in the neutrino sector. The hierarchy is also 
central to interpreting the next generation of neutrinoless double beta decay results, plays a role 
in numerous cosmological and astrophysical questions, and serves as a powerful model discriminant 
for theories of neutrino mass generation and unification. Various current and planned experiments 
claim sensitivity for establishing the neutrino mass hierarchy. We review the most promising of these 
here, paying special attention to points of concern and consolidating the projected sensitivities into 
an outlook for the years ahead. 


I. INTRODUCTION 


In the Standard Model with non-zero neutrino masses, the three neutrino flavor eigenstates Vf,, and Vt are 
non-trivial linear combinations of the three mass eigenstates i>i, 1 ^ 2 , and t'a. The complex matrix U relating the flavor 
and mass eigenstates to each other, called the PMNS matrix for the authors of Refs. can be parametrized in 

terms of three rotation angles and three complex phases: 

/ 1 0 0 \ / C13 0 si3e-*'5 \ / C12 S12 0 \ / 0 0 \ 

c/ = 0 C23 S23 0 1 0 -S12 C12 0 0 0 , (1) 
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with Cij= cos 0ij and Sij=sui9ij. While this neutrino mixing gives rise to several physical phenomena, the most 
experimentally fruitful one so far has been neutrino oscillations, whereby the flavor composition of a neutrino can 
vary as it propagates. The probability P that a relativistic neutrino produced with flavor I will be detected as a 
neutrino of flavor I' after traveling a distance L through vacuum is given by 


P{^i^vv) 


= 5a' - 


4^5R(C/;,C/zqC/y[/p^.)sin2 

i>j 


(^) 


+ 2^S5(C/;,UzqUyt/p^.)sin 

i>j 


(^) 


( 2 ) 


where E is the neutrino’s energy, Arri^j = mf — is the difference between the squares of the masses of the mass 
eigenstates v; and vj, Sw is the Kronecker delta, and the matrix elements Uu are defined by 
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Our quantitative understanding of the neutrino sector has grown tremendously in the past two decades primarily due 
to neutrino oscillation experiments. The “solar” mass splitting Am^i [1] and the thirty-times-larger “atmospheric” 
mass splitting Am §2 [5] have been measured with impressive precision: 

Amii = (7.53 ± 0.18) x 10"^ eV^ (4) 

Am ^2 = (2-34 ±0.09) X10-3 eV^ [NH] or (-2.37tJ);‘i’J)x lO-^ eV^ [IH] . (5) 

The normal hierarchy (NH) and inverted hierarchy (IH) distinction is defined below. [HT] In addition to these mass 
splittings, we know at varying levels of precision the values of the three mixing angles 0 i 2 , ^ 13 , and 623 [5]. We do 
not have good information about the sign of the atmospheric mass splitting, the complex phases in the PMNS matrix 
of Eq. Q, the absolute masses {mt} of the neutrinos, or whether neutrinos are Majorana particles. Furthermore, 
several experimental anomalies suggest that extensions to the neutrino standard model may be needed [7]. 

The scope of this review is limited to the experimental prospects for measuring the unknown sign of the atmospheric 
mass splitting or, equivalently, whether V 3 is the heaviest or lightest mass eigenstate. These two possibilities are 
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termed, respectively, the normal and inverted neutrino mass hierarchies, and Figure[^shows the two cases graphically. 
Some authors prefer the term “mass ordering” over “mass hierarchy” when a third case - a degenerate spectrum - 
is relevant to the context. In a degenerate spectrum, the mass of the lightest neutrino is large compared to the 
individual mass differences, i.e. Of note, neutrinoless double beta decay rates depend strongly on 

the mass hierarchy only if the mass spectrum is non-degenerate [S]. In more general contexts, authors regularly use 
“ordering” and “hierarchy” interchangeably. 

The precision of neutrino sector measurements has reached a point where the unknown hierarchy is a major hurdle 
to further progress. Knowledge of the hierarchy is also vital to interpretation of neutrinoless double beta decay results, 
serves as an input to cosmological and astrophysical measurements, and is a powerful discriminant among unification 
and neutrino mass models. The neutrino mass hierarchy may also be the next major unknown of the Standard Model 
to be measured. In this article we review the experimental outlook for making this measurement. 
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FIG. 1: The two possible neutrino mass hierarchies. The colors represent the approximate flavor admixtures present in each 
mass eigenstate. In the normal case, the masses exhibit a hierarchical pattern, and the lightest nentrino has the largest 
admixture of electron flavor. The relative ordering m 2 >m\ is known through observations of solar neutrinos, which are subject 
to resonant matter effects in the sun [U Iflllllj. 


II. MAKING THE HIERARCHY MEASUREMENT 
A. Physical principles 

Several distinct approaches for probing the mass hierarchy are in use or under consideration by current and future 
experiments. One approach is motivated by the vacuum oscillation probability in Eq. Q. Since this probability 
depends on each of the mass splittings , a sufficiently precise measurement of T’(E) for a given set of initial 

and final flavors I and I' can determine whether Am '^2 slightly larger or smaller than Am§]^. This approach suits 
reactor-sourced Vg —>■ Tg oscillation measurements, as low energy neutrinos (few MeV) are needed to obtain significant 
contributions from the Ato|]^ terms at distances L that are short enough (tens of kilometers) to still have useful event 
rates at the detector. 

For neutrinos traveling through matter, the presence of electrons in the propagation medium modifies the Hamil¬ 
tonian for Vg differently than it does for and I'r [SHE]. Qualitatively, the effective masses of the i/g-dominated vi 
and 1^2 states are increased (or decreased for antineutrinos) by the presence of the electrons, and the kinematic phases 
AmfjL/{4:E) follow suit. This either compresses or expands the effective mass splittings, depending on the hierarchy, 
modifying the oscillation probabilities in a observable way. 

In a different arena, cosmological measurements of large-scale structure, the cosmic microwave background, and its 
H-mode polarization provide access to the hierarchy via the sum of the neutrino masses mui]. In particular, were 
these data to yield 'Y^rrii < 2^J\Am^\ ~ 100 meV, the hierarchy would have to be normal. A analogous constraint 
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can be made for the effective electron neutrino mass measured in beta decay experiments, but upcoming experiments 
will not have the mass sensitivity required to provide hierarchy information even if the necessary conditions on the 
mass spectrum (normal hierarchy and small mi) are met. 

Neutrinoless double beta decay searches will rely on external knowledge of the hierarchy to interpret observed (or 
limited) decay rates [5] and will not provide independent measurements of the hierarchy themselves. Likewise, the 
neutrino fluxes from supernovae carry signatures of the hierarchy due to both matter effects and collective (high 
neutrino density) effects |15H17] . but uncertainties in the astrophysical models make reliable determination of the 
hierarchy through supernova neutrino observations unlikely even if a nearby supernova event were to occur in the 
near future. 

In the sections below, we describe specific experimental efforts toward measuring the mass hierarchy, discussing 
the advantages, possible pitfalls, and expected sensitivity of each. The collection is not exhaustive in that we exclude 
experiments that do not have notable hierarchy sensitivity or are unlikely to proceed due to programmatic, budgetary, 
or other practical considerations. 


B. Statistical issues 

First, it is worth devoting a few words to the statistical interpretation of the sensitivities. Though the question at 
hand can be stated quite simply - Is the hierarchy normal or inverted? - much discussion of its statistical treatment 
has taken place in recent years, largely due to the lack of a universally accepted definition of sensitivity. Three 
definitions appear most often in the literature. 

The first makes use of pseudo-data formed from the predictions for the experimental observables. More specifically, 
one of the hierarchies is taken as truth and the corresponding predicted observables are used directly as the pseudo¬ 
data, with no statistical or systematic fluctuations introduced. This data set, for which the name “Asimov” data 
set m is gaining traction in the literature, is then fit assuming the opposite hierarchy to find the minimum for 
this wrong hierarchy assumption, marginalizing over all free oscillation and systematic nuisance parameters. Since the 
correct assumption must produce for this data set, the minimum found for the wrong hierarchy represents 

a typical between the hypotheses. Motivated by the behavior of Gaussian systems, -\/A^ is then taken as the 
hierarchy determination sensitivity as a number of a. 

In a second approach, an ensemble of fully fluctuated data sets is simulated assuming one hierarchy and used to cal¬ 
culate the expectation value for the Ay^ between the incorrect and correct hierarchy hypotheses, again marginalizing 
over nuisance parameters for each simulated data set. The sensitivity is then quoted as (Ay^). 

The third common approach starts again with an ensemble of fluctuated data sets assuming one hierarchy. The 
sensitivity is quoted as the frequency with which the correct hierarchy hypothesis fits to a lower y^ than the incorrect 
one. This sensitivity may be given as a p-value directly or converted into the corresponding number of Gaussian a. 

Many variants of these approaches have also been used or proposed. A given author may choose one method over 
another to facilitate comparisons or combinations with past research, to promote a Bayesian or frequentist philosophy 
to reporting results, to make calculations simpler or computationally faster, or to ensure statistical accuracy. Refs. m, 
[IS] I [H] j and |22| serve as further introduction to these issues. We take a pragmatic view here, noting that on the whole 
these concerns tend not to lead to actionable differences in sensitivity estimates. These differences are quantified within 
the frequentist realm in m- Since frequentist and Bayesian techniques answer fundamentally different questions, 
cross-comparisons between them are more fraught, especially at low sensitivities. Nevertheless, the primary concerns 
that arise when examining sensitivity projections are rarely these fraction-of-a-cr effects but rather the broader issues 
of engineering or detector performance requirements, the treatment of systematic uncertainties, the realism of stated 
timelines, or the prospects for obtaining funding for the desired experimental scope. 

Another popular statistical question surrounding the mass hierarchy is: With what confidence must we establish it? 
In practice, each increment in hierarchy sensitivity adds importantly to our understanding. Even a 2 (t measurement 
would begin guiding the model-building community and would become a meaningful input in other areas of physics 
{e.g.^ cosmology and astrophysics). As the hierarchy significance increases, so too does the significance of any mea¬ 
surement that takes the hierarchy as input. Thus, while a 3cr hierarchy measurement will be a historic milestone for 
particle physics, a continued push to 5 ct or beyond is needed if we are to make >5 (t claims for other, quantitatively 
related questions {e.g., exclusion of Majorana neutrinos due to future non-observation of neutrinoless double beta 
decay). Whether those subsequent questions require high levels of significance is beyond the scope of this article. 
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III. EXPERIMENTS 
A. Long-baseline 

Long-baseline neutrino experiments use accelerator-based sources of or and large detectors some hundreds of 
kilometers downstream to investigate neutrino oscillations. These experiments probe the mass hierarchy through the 
matter effects that modify the appearance probability P{v^ —>■ Ve) and its antineutrino counterpart from their vacuum 
values. Neglecting higher-order terms in a = Am|]^/Am §2 ~ 0.03 and sin^(0i3) « 0.02, the transition 

probability for long-baseline experiments is given by |23) 


Pel’ll ^e) 


sin^ 6*23 sin^ 20i3 
-I- aJ cos(A ± S) 


2 on sin2(A(l-a;)) 
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sin(Aa:) sin(A(l — x)) 
x{l — x) 


( 6 ) 


o o „ o „ sin^lAa;) 

-|- cx cos 023 sin 2012 - 3 - 

x^ 


where J = cos 0i3 sin 20i3 sin 20i2 sin2023, A = Am32L/(4£’), x = P2^/2GpUeE /and the plus (minus) signs 
apply to neutrino (antineutrino) oscillations. In the definition of a;, Gp is the weak coupling constant and rig is the 
number density of electrons in the propagation medium. 

The mass hierarchy enters via the (1 — a;) factors in Eq. with x switching signs between the normal and inverted 
cases. To increase the scale of this perturbation, one must increase the neutrino energy E. However, one must also 
stay near the oscillation maximum of A « 7r/2 to have significant oscillation probability at all. Thus, the experimental 
baseline L must also increase, leading to the practical fact that an increased sensitivity to the mass hierarchy in the 
Ve and Ve appearance channels comes from having a longer experimental baseline. 

Figure shows how the appearance probability changes over the three baselines discussed here: 295 km (T2K), 
810 km (NOz/A), and 1300 km (DUNE). The probabilities are plotted for all possible values of the CF-violating phase 
5, tracing out ellipses in the figure. The uncertainty in 5 complicates the mass hierarchy measurement, although less 
so at the longest baselines since the hierarchy-dependent matter effects become larger than any possible (5-induced 
changes in the oscillation probabilities. Hierarchy sensitivities for long-baseline experiments are often quoted as a 
function of the true (if unknown) 5. The uncertainty on 023 is the next largest factor after 5, with the current allowed 
range of 023 leading to a peak sensitivity uncertainty of roughly 20%. This dependence is not always explicit in 
experiments’ official sensitivity plots, which often assume a fixed typical value for 023, say 7r/4. 

T2K and NOz^A are currently operational, and DUNE is a planned future experiment. Several concepts for a future 
long-baseline experiment, some with impressive hierarchy sensitivity, have been studied and pursued in recent years. 
However, most of these have remained virtual given the need for significant funding (approximately US$1B) and a firm 
programmatic commitment from a suitable facility. Today, two efforts have significant momentum. One is DUNE, a 
newly formed international collaboration that, in practice, merges several previously independent efforts. The other. 


Hyper-K, is discussed in Section HIB 


1. T2K and NOuA 


T2K [24] sends neutrinos from J-PARC to the Super-Kamiokande (Super-K) detector in the Kamioka mine 295 km 
away. The storied Super-K detector is a 50-kton (22-kton fiducial) water Cherenkov detector situated 44 mrad 
off the T2K neutrino beam axis. An off-axis arrangement, also used by NOz/A, yields a narrower neutrino energy 
spectrum with a reduced high-energy tail, thus decreasing the rate of neutral current events that form the dominant 
background for the signal Uf, charged current channel. In 2013, T2K made the first definitive observation of > Vg 
oscillations at 7.3cr using ~10% of the experiment’s eventual planned exposure [5S]. However, as Figure suggests, 
the relatively short baseline of T2K precludes determining the mass hierarchy from this Vg appearance data alone. 

NOz/A ^7] uses the NuMI neutrino source at Fermilab and a new 14-kton highly segmented tracking calorimeter 
810 km away and 14 mrad off the neutrino beam axis in Ash River, Minnesota. NOz/A began taking data with a 
fully deployed 14-kton detector in 2014. With its full planned exposure, NOz^A can determine the hierarchy at 2-3cr 
if the value of <5 falls in a favorable range. For unfavorable S values, NOz/A extracts correlated information about S 
and the hierarchy and thus does not determine the hierarchy outright. In these so-called “degenerate” cases, it helps 
a bit to combine NOz/A and T2K data since T2K is sensitive to S but largely insensitive to the hierarchy. Figure]^ 
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FIG. 2: P{yf_i —^ z7e) versus P{vti —>■ r'e) for both hierarchies (red 
and blue ellipses) and for the full range of 5 (cycling around 
the ellipses) for a representative L/E value of 0.4 km/MeV. 
At the T2K baseline, the neutrino and antineutrino oscillation 
probabilities differ very little between the two hierarchies and 
thus measurements of these probabilities offer minimal hierar¬ 
chy discrimination. For NOi^A, significant splitting of the two 
cases occurs. For DUNE, the separation is complete, allowing 
for unambiguous determination of the hierarchy regardless of 
5, assuming small enough measurement errors on the probabil¬ 
ities. This figure is illustrative only, as it keeps other oscillation 
parameters fixed and as a full experiment involves a range of 
neutrino energies. 


shows the projected reach of NOz/A both with and without the inclusion of T2K data. NOz^A is the only operational 
experiment with any significant hierarchy sensitivity, albeit with a possibility of confusion from 5. 

Both T2K and NOz^A use multiple detectors to mitigate systematic uncertainties. NOz^A uses identical detector 
technologies at its two sites at 1 km and 810 km. T2K uses a multipurpose, off-axis near detector at 280 meters 
together with a beam monitoring detector situated on-axis. The multi-detector approach makes the T2K and NOz/A 
experiments statistics limited for some time, so our knowledge of the hierarchy in the next five to ten years will be 
driven in part by the beam power achieved at Fermilab and, to a lesser extent, at J-PARC. The NuMI beam reached 
400 kW in early 2015 and is projected to reach the design 700 kW within two years, after the completion of upgrades 
to the Booster, which lies upstream in the accelerator chain. The J-PARC neutrino source is operating at ^325 kW, 
and significant future upgrades are needed to realize the planned 750 kW. 
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NOvA hierarchy resolution, 3+3 yr 



NOvA hierarchy resolution, 3+3 yr 



FIG. 3: (Left) Hierarchy sensitivity for the planned NOi^A exposure of 3.6x 10^^ protons on target (POT), plotted as a function 
of true 5. (Right) The same with the addition of 5.5x10^^ POT of T2K data. The T2K data help break the 5- 

vs.-hierarchy ambiguity somewhat, although outright hierarchy determination is still unlikely in the unfavorable regions of 
parameter space. Figure adapted from Ref. |28| . 


2. DUNE 

In the past decade, significant global effort has gone into designing a next-generation long-baseline experiment, and 
well-developed proposals have come out of this process. The most challenging aspect at present is budgetary. The 
full physics scope of a future experiment - precision measurements of PMNS matrix elements, CP violation, proton 
decay, supernova physics, and beyond-the-Standard-Model searches - requires a substantial detector and a beamline to 
match, at a combined cost surpassing US$1B. Such a program is only achievable through international partnerships, 
and a new collaboration, originally named ELBNF and since renames DUNE, has recently formed with the early 
goal of carrying forward the significant design work already completed to form a fully international long-baseline 
experimental proposal [29] . 

Eermilab will host the planned 1.2-MW neutrino source for DUNE, and the experiment’s far detector will be located 
in the Sanford Underground Research Facility (SURF) in South Dakota, yielding a baseline of 1,300 km. The far 
detector will be a liquid argon time-projection chamber (TPC) with a total mass of 40 kton and a fiducial mass around 
34 kton. One of the existing design efforts, LBNE, also assumed a Fermilab-to-SURF layout with a large liquid argon 
detector, so we use the LBNE sensitivity estimates here [30] as a proxy for DUNE. 

The ten-year hierarchy sensitivity for a 40-kton DUNE is substantial (Figure]^. Even at the least favorable 5 and 
023 values, DUNE achieves 5 (t significance and exceeds lOcr for other parameter assumptions. Although some open 
questions remain regarding near detector performance, far detector event selections, and systematic uncertainties, 
these hierarchy sensitivity estimates are unlikely to vary grossly as the open questions are addressed. The greatest 
uncertainty in the reach of DUNE is in the timeline and scope of the experiment. The collaboration aims to deploy 
an initial 10-kton detector underground at SURF by 2021, followed shortly by completion of the neutrino source and 
further deployment to 40 kton. These timescales are achievable but aggressive. The recent US high energy physics 
strategic plan m strongly supports this effort, and operation of a 10-kton detector and a 1.2-MW source is estimated 
to begin in 2025. How quickly additional detector mass can be added and how quickly a 1.2-MW beam can be 
deployed will play a large role in the sensitivities realized. To good approximation, the sensitivities of Figure]^ can 
be scaled according to the square root of exposure. 


B. Atmospheric 

Atmospheric neutrino experiments take advantage of the flux of neutrinos - a reasonably well-characterized mix of 
iZe, and Vg - produced by cosmic ray showers in the atmosphere. Since detectors can collect these neutrinos 
from all directions, the distance between neutrino production and detection can range from tens of kilometers for 
neutrinos produced above the detector to 13,000 kilometers for neutrinos produced below (ie., on the other side 
of the Earth). This wide range of baselines, together with a wide range of energies, makes atmospheric neutrinos 
particularly useful probes of neutrino oscillations. 
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Mass Hierarchy Sensitivity (NH) 


Beam, Signal/BG Unc^ 
CDR, 5%/10% 

80 GeV, 5%/10% 
-80 GeV, 1%/5% 



FIG. 4: Hierarchy sensitivity for DUNE at a fiducial exposure of (1.2 MW)x(34 kton)x(10 yr). The red band spans the range 
of assnmptions shown: (dotted) beam design in the LBNE conceptual design report and 5% (10%) uncertainties taken for the 
signal (background), (dashed) a further optimized beam design using 80 GeV protons, and (solid) the same as the dashed line 
but with more aggressive systematic uncertainties. Normal hierarchy is assumed. Figure adapted from Ref. m- 


Atmospheric neutrino experiments gain hierarchy sensitivity through the resonant-like enhancement of matter effects 
experienced by either neutrinos or antineutrinos (depending on the hierarchy) as they pass through the Earth |32H34) . 
The effect on the oscillation probabilities is often presented in so-called oscillograms (e.g., Figure]^. Experiments 
that can study neutrinos and antineutrinos separately need only determine which sample exhibits the matter effect 
enhancement. Experiments that cannot separate these must make a statistical inference based on knowledge about 
the incident neutrino and antineutrino fluxes. Current experiments, notably Super-K [35] and MINOS [3, have 
demonstrated the principle of this measurement, but these detectors are too small and thus have too few events to 
obtain significant hierarchy sensitivity, despite their decade-long exposures. Proposed future large detectors fall into 
three categories: water Cherenkov (PINGU, ORCA, Hyper-K), magnetized iron tracker (INO), and liquid argon TPC 
(DUNE). 


1. PINGU, ORCA, and Hyper-K 

PINGU (Precision IceCube Next Generation Upgrade) [31] is a proposed extension to the IceGube detector |37j at 
the South Pole. PINGU would add an array of 40 strings each with 60 optical modules to the DeepCore region of 
IceGube, spanning 4 Mton of ice. As the technology is well established, the cost and schedule estimates are robust, 
and operation could begin by 2020 if funding is established soon. The relatively high density of optical modules in 
PINGU versus the rest of IceGube would lower the triggering threshold to ^I GeV and would provide good energy 
and angle resolutions for events in the 5-15 GeV region in which the hierarchy signature lives. These resolutions are 
critical inputs to the sensitivity estimates, as the matter effects of Figure]^ are heavily smeared. 

PINGU projections have evolved significantly in the past couple of years. Detailed studies of detector performance 
and the effects of systematic uncertainties (notably energy scale; v and V cross sections; and the oscillation parameters 
Am| 2 , 6 * 23 , and @ 13 ) have been carried out [33]. Further, the prospects for improvements through detector geometry 
optimization, incorporation of event inelasticity as a weak neutrino/antineutrino discriminant |38j . and more advanced 
reconstruction techniques suggest that the current sensitivity estimates may be conservative. Figure [^ shows the 
sensitivity of the combined track (v^ charged current) and cascade (all other) channels in PINGU, which reaches 3 ct 
hierarchy sensitivity in 4 years of running if 023 is in the lower octant. 

ORCA (Oscillation Research with Cosmics in the Abyss) [39] is a proposed option for a multi-megaton water 
Cherenkov array based on the KM3NeT [ID] deep-sea technology. The measurement strategy is the same as for 
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FIG. 5: Oscillation probabilities —>■ {left) and P(l7^—>17^) {right) for neutrinos passing through the Earth for the 

normal mass hierarchy. The probabilities are plotted as a function of the neutrino energy E (expressed as log]^Q(i?/GeV)) and 
the direction of travel 0i/, with cos(0i/)=l corresponding to an upward neutrino trajectory into the detector. These neutrinos 
will have passed through the full diameter of the Earth. Trajectories at cos(0iy)=O correspond to the horizon. The matter 
effect enhancement is seen in P{Vfj, —>■ Vn) at 2 GeV < E < 15 GeV and is absent in ^(17^ —^17^). The situation is reversed for 
the inverted hierarchy. A corresponding effect occurs for the Ve and z7e appearance channels. Figure adapted from Ref. |34| . 


PINGU: use a dense array of optical modules to obtain adequate energy resolution, angle resolution, and event 
identification capabilities to observe the matter effect enhancement for neutrinos below 15 GeV. The latest ORCA 
sensitivity studies have included track and cascade channels, and the proposed detector has grown from 2 Mton with 
50 strings to 4 Mton with 115 strings. The stated ORGA sensitivities are on par with, or somewhat better than, those 
of PINGU, with 3cr sensitivity anticipated in 3 years [41]. However, the ORGA estimates are less well developed and 
documented than those of PINGU, so it is difficult to assess how they will evolve as work on systematic uncertainties 
and analysis techniques continues. 

The proposed Hyper-K detector [32] consists of two cylindrical water-filled tanks with a total mass of 1 Mton 
and with 20% photocathode coverage provided by 10,000 photomultiplier tubes. Hyper-K would be sited near the 
Kamioka mine and would also serve as the far detector for the long-baseline T2HK experiment, a scaled-up version of 
T2K. Like T2K, T2HK has limited hierarchy sensitivity on its own given its baseline, and thus Hyper-K’s hierarchy 
sensitivity comes primarily from atmospheric neutrinos. The excellent event reconstruction capabilities of the Hyper- 
K detector are well understood based on long Super-K experience. The most glaring uncertainty for Hyper-K is 
whether the project proceeds. Whereas PINGU and ORGA are relatively low-cost options (US$50M-$80M), Hyper-K 
would require US$500M-$700M over a seven-year construction timeline. Given the focus of the U.S. and European 
long-baseline communities on liquid argon TPC approaches, substantial funding may need to come from Japan alone. 
This higher cost over PINGU and ORGA, of course, brings with it a rich physics program including leptonic CP 
violation, proton decay, and a range of solar and astrophysical measurements |42j . 

Hyper-K can reach 3a mass hierarchy sensitivity after ten years of operation assuming a lower-octant value for 023 
of 0.4. This significance increases to 6cr (or, equivalently, the time to 3cr falls below three years) for an upper-octant 
value of 023 =0.6. This strong dependence of hierarchy reach on 023 is characteristic of all atmospheric experiments. 
The sensitivities for PINGU and ORCA quoted above take conservative values of 023 « 0.4. For 023 = 0.6, PINGU can 
reach 3tT sensitivity within its first year of operation and 6tT after four years. 


g. ICAL 

The ICAL detector at the India-based Neutrino Observatory (ICAL@INO) |33| is a planned 50-kton magnetized 
tracker made of 150 alternating layers of iron and resistive plate chambers. The 1.5-T magnetic field provides excellent 
/r“//i+ separation, and consequently v^jv^ separation, on an event-by-event basis. This advantage, together with 
the detector’s excellent energy and angular resolution, mitigates to some degree the lower event rates due to ICAL’s 
small fiducial mass. The ICAL hierarchy sensitivity after ten years of operation ranges from 2.5tT to 3.5 ct depending 
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PINGU livetime [yrs] 


FIG. 6: Mass hierarchy sensitivity for PINGU as a function of livetime. The solid red “multichannel” curve represents the best 
estimate. In the past, atmospheric experiments focused on the track events from and z7^ charged current interactions, which 
on their own lead to the bottom blue curve (“tracks”). More recently the power of the cascade events, particularly Ve. and 
charged current interactions, has been taken into consideration even for detectors with relatively poor particle identification 
capabilities. The deviation of the multichannel curve from a \fi behavior (dashed red) provides a measure of the impact of 
systematic uncertainties. Figure adapted from Ref. |36| . 


on the true value of 023 - 

ICAL@INO is moving ahead with support from India, and site work has begun. Operations are projected to begin 
in 2018, although given the required civil construction that date is likely optimistic. The detector technology is 
well-established and should not present problems scaling up to 50 kton. Continuing detector R&D is focused on 
performance and longevity optimization [44j . One concern is that the sensitivity is somewhat low compared with that 
of other experiments proposing results on a similar time frame (c. 2030). 


3. Large liquid argon TPC 


In contrast to Hyper-K, for which the atmospheric neutrino sample provides the bulk of the hierarchy sensitivity, 
dune’s atmospheric measurement is secondary to its accelerator-based measurement (Section III A 2). A 340-kton- 
year atmospheric exposure yields 3-5cr sensitivity, depending on 023- This can be compared with the accelerator-based 
measurement that bottoms out at Str in the extreme worst case. However, if the beam power of 1.2 MW is significantly 
delayed relative to the detector deployment, the DUNE atmospheric sample could play a role, especially because the 
detector’s excellent energy and angular resolution make it particularly good for this application. 


C. Reactor 

For the ~l-8-MeV Vf, streaming from a nuclear reactor, the first oscillation maximum for Amlj^-driven solar 
oscillations occurs at L^lOO km and for Am 32 -driven atmospheric oscillations at L^2 km. KamLAND has successfully 
exploited the former to measure 0i2 and Am^i [I]- More recently Daya Bay, RENO, and Double Chooz have exploited 
the latter to measure 0 i 3 and the effective squared-mass splitting Am^g [iSVHT] . 

At the longer of these baselines, the small amplitude and rapid (with respect to E) atmospheric oscillation will 
be superimposed on a deeper and slower solar oscillation. The relative phase of these oscillation modes reveals the 
relative sizes of Aml^ and Am'^ 2 - Figuredemonstrates the principle of the measurement for a baseline of 50 km. 
The broad solar dip in the Ve survival probability is visible together with the fast atmospheric oscillations, and flipping 
the sign of Am |2 shifts the period and phase of these oscillations in an observable way. 
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neutrino energy (MeV) 


FIG. 7: The oscillation probability Pive^Ve) as a function of neutrino energy for a fixed neutrino baseline of 50 km. The 
range of energies shown spans the relevant reactor flux. The wide dip centered at 3 MeV is due to the dominant solar 
oscillation, and the wiggles correspond to the sub-dominant atmospheric oscillation. The two curves differ only in the assumed 
sign of Am ^2 (and also in the magnitude of Am|i since the magnitudes of the other two splittings were held fixed.) 


Leveraging this effect requires a large, low-background detector with unprecedented energy resolution. Over the 
past decade, the experimental requirements have been extensively explored [48H54j . leading today to two experimental 
proposals, JUNO and RENO-50 [SSJ [HS]. JUNO (Jiangmen Underground Neutrino Observatory) proposes a 20- 
kton liquid scintillator detector with 3%/ ^/E/MeV energy resolution. For comparison, KamLAND used a 1-kton 
detector with an energy resolution of 6.4%/y/U/MeV [1]. The required advance in energy resolution is to come from 
increased photocathode coverage, higher-efSciency phototubes, and brighter yet more transparent scintillator. R&D 
on these fronts has led to encouraging progress, but significant work remains to demonstrate that this performance is 
achievable m- 

The pattern of the fast oscillations is also susceptible to systematic uncertainties. An error in the magnitude of 
Am ^2 or in the overall energy scale of the detector will skew the oscillation pattern in a similar, though not identical, 
way to flipping the hierarchy. Energy non-linearities are more pernicious, and Ref. |54j derives a requirement of <0.5% 
variation in energy scale across the spectrum. This requirement can be relaxed to ~2% (a performance demonstrated 
with the smaller KamLAND detector) if the functional form of the non-linearity is known at some level |55) . 

JUNO is sited equidistant from two primary reactor sites, each with multiple cores. Each core corresponds to 
a slightly different baseline, and the JUNO sensitivities take into account the resulting smearing of the oscillation 
pattern. With a 6-year run, JUNO projects a mass hierarchy sensitivity of 4cr, give or take 10% depending on the 
level of precision achieved by long-baseline experiments on the atmospheric mass splitting. JUNO has strong support 
from China and is moving ahead, with operations to start around 2020. If the detector R&D does not lead to the 
performance metrics required, JUNO’s hierarchy sensitivity could be considerably reduced. However, JUNO has an 
impressive precision oscillation physics program separate from the mass hierarchy measurement m- 

RENO-50 is a similar proposal, based in Korea, to use an 18-kton detector with performance requirements that 
are essentially identical to those of JUNO [S5]. While sensitivities of 3cr have appeared in the literature [5S], the 
calculations are notably less detailed than those of JUNO {e.g., accounting for the smearing from multiple reactors). 
Detector R&D funding for RENO-50 has been established, but start of operations by 2020 as proposed by the 
experiment’s proponents is likely optimistic. 


D. Cosmology 

Neutrinos in the early universe damped out density fluctuations on scales smaller than their free streaming length, 
which began decreasing as the neutrinos red-shifted toward non-relativistic energies. The masses of the neutrinos 
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governed the timing of the non-relativistic transition. Thus, these masses are imprinted on the matter power spec¬ 
trum today. Multiple experimental techniques are sensitive to this imprint, including cosmic microwave background 
anisotropy and B-mode polarization measurements, galaxy surveys, weak leasing, and the Lyman-a forest [Hill]. 

Cosmological measurements are sensitive primarily to the sum of the neutrino masses, not the individual masses. 
The most powerful constraints on this sum naturally comes from combinations of data sets, and current upper limits 
fall at ^ mi < 0.5 eV [ 6 ]. Upcoming terrestrial and space-based measurements that can improve on these limits 
include MS-DESI, Euclid, LSST, and a Stage-IV CMB project. All of these projects are relatively likely to proceed, 
with timescales for start of operations ranging from 2018 to 2022. The forecast uncertainties on ^ rui begin at 20 
meV (c. 2025) and improve to ^10 meV (c. 2030), with past data sets taken in combination with future ones 150 j . 

For the normal hierarchy, ^ rrii must be at least 0.06 eV given the oscillation measurements of Eqs. Q and ([^. For 
the inverted hierarchy, ^ m^ must be at least 0.1 eV. Thus if the hierarchy is normal and mi m 3 , the uncertainties 
given above for ^ mi translate into 2 ct sensitivity for determining the neutrino mass hierarchy by 2025 and 4 (t a 
few years later. The requirements on the mass spectrum for this measurement are critical: an inverted hierarchy or 
a smallest neutrino mass larger than ^0.04 eV eliminates entirely the hierarchy-determining power of cosmological 
observations. 


IV. OUTLOOK 

It is useful to summarize the experimental sensitivities given above in a time-ordered, rather than technique-ordered, 
manner. We skip here those options with underdeveloped estimates. 

• Today: Super-K (atmospheric), MINOS (atmospheric), and T2K (accelerator) have proof-of-principle results 
that constrain the mass hierarchy at an insignificant level. These sensitivities will not improve substantially 
with added exposure. 

• By 2020: NOizA will have most of its planned data set and can make a statement at 2-3cr for favorable values 
of 5. For unfavorable values, NOj^A’s hierarchy measurement will be correlated with its 5 measurement, and an 
unambiguous hierarchy determination will not yet be possible. In these cases, combining with T2K will help 
marginally. 

• By 2025: PINGU could provide 3-6cr sensitivity if the experiment is deployed in a timely manner. If the JUNO 
detectors perform as required, a partial JUNO exposure could provide 2-3tT sensitivity by this date. 

• By 2030: Some uncertain fraction (<50%) of DUNE’s 340-kton-year exposure could be available, yielding sen¬ 
sitivities of >3 ct. Similarly, Hyper-K could provide 2-4cr, and JUNO’s sensitivity would reach its design 4tT. 
Input from cosmology would be firmly in hand. ICAL could reach 3 ct by this time. 

• By 2035: If DUNE proceeds in a timely manner, a large fraction of its full exposure would be available, and the 
hierarchy would be definitively established at >5tT. Hyper-K would be at or beyond its design 3-6 ct. 

Given the complementarity of the various techniques, it is natural to explore combined sensitivities, and the literature 
contains a near-exhaustive set of combinations. The NOi^A and T2K combination is discussed in Section [III A[ above. 
Atmospheric and reactor experiments offer a particularly interesting synergy, as fits to the wrong hierarchy introduce 
different pulls on Am ^2 between the two techniques. The PINGU-I-JUNO case is explored in Ref. [50] . 

Setting aside the potential sensitivity boosts via combinations, however, it is critical to measure the hierarchy 
through independent techniques, not only for the added confidence in the results but also as protection against 
unfavorable parameters of nature. To wit, atmospheric sensitivities are largely independent of 6 , in contrast to 
accelerator sensitivities. Reactor sensitivities are invariant with respect to 023; in contrast to atmospheric sensitivities. 
Gosmological measurements are entirely free of dependence on oscillation parameters but do require a very particular 
mass spectrum. 

Table m summarizes the experimental situation and the prospects for determining the neutrino mass hierarchy in the 
coming years. The table lists the most critical concerns, some of which are major. However, if even just a couple of the 
promising experimental options becomes reality without delay, the outlook can be succinctly stated: A statistically 
significant (>95% GL) hierarchy determination is possible, but not guaranteed, by 2020; a statistically significant 
determination is likely by the late 2020s; and a definitive (>5 (t) measurement is possible by the late 2020s and likely 
by the early 2030s. 
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TABLE I: Prospects for measuring the neutrino mass hierarchy in the coming years. Sensitivity ranges are given to cover 
current uncertainties in other oscillation parameters. Comments and concerns are given in the last column, with the most 
worrisome specific concerns placed in italics. 


Experiment 

Hierarchy Approximate 

sensitivity timescale Comments and concerns 

NOzzA-hT2K 

l-3cr 

2020 

currently operating below designed beam power 

DUNE 

3-6ct 

2030 

funding, timeline 


5-lOa 

2035 


PINGU 

3-6(7 

2025 

funding; past systematics and resolution concerns largely addressed 

ORCA 

3-6(7 

- 

insufficiently developed at present 

Hyper-K 

3-6(7 

2030 

funding, timeline 

ICALOINO 

2-4ct 

2030 

timeline 

JUNO 


2027 

detector performance not yet demonstrated 

RENO-50 

~3cr 

- 

insufficiently developed at present 

Cosmology 

0-4(7 

2027 

Oct for most of allowed mass range; requires minimal NH spectrum 
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